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Description 

FIELD OF THE INVENTION 

5 The invention relates to a microprocessor comprising a processor element, a memory interface element, an IO 

interface element, a debug support element and an internal bus interconnecting all above elements. Such microproc- 
essors have been in general use for executing computer programs in an environment of calculating, controlling, signal 
processing and other. A program must be checked for errors or other malfunctioning during a debugging operation. 
Often, this is done by successively recording all addresses transferred via the microprocessor bus. This recording can 

10 be done, either by monitoring the internal bus directly, or by temporarily logging such addresses for subsequent out- 
putting and checking. Often, such procedures either take much time in a step-by-step operation, or alternatively, require 
extensive external accessibility through a large amount of additional microprocessor pins. Both approaches are ex- 
pensive. Moreover, it is difficult on an organizational level to execute the monitoring in real time in view of the enormous 
amount of data produced. 

15 

SUMMARY TO THE INVENTION 

In consequence, amongst other things it is an object of the present invention to diminish the above costly require- 
ments and improve the functionality through the usage of the so-called boundary scan standard facilities that currently 

20 are being introduced into a targe fraction of complex integrated circuits. Now, according to one of. its aspects, the 
invention is characterized in that by comprising attached to said internal bus a registered boundary scan standard 
JTAG interface element that accesses one or more scan chains inside said microprocessor, and furthermore said JTAG 
interface element is arranged for controlling DMA-type exchanges via said internal bus with other elements connected 
to said internal bus. Boundary scan standard or JTAG Standard has been described extensively in IEEE Standard 

25 1149.1, and in particular in GB Patent Application 2,195,185 and corresponding US Application Serial No. 07/90489 
to the present assignee. Originally, the Standard was conceived to facilitate board level testing, but it offers many 
advantages on the level of a single integrated circuit as well, and in consequence will hereinafter be called JTAG 
standard for brevity According to the Standard the minimum test interface has one serial data input pin, one serial 
data output pin, a test clock pin, and a test control pin. An additional reset pin is optional. According to the Standard, 

30 under external synchronization, first a control pattern is loaded into a circuit, which pattern may be used as well for 
addressing the circuit in question, and so provides test initialization. According to the Standard, next the test pattern 
is loaded in ah input register. After a brief interval of normal operation of the circuit, the test result is outputted from an 
output register, which may be overlaid with entering the next test pattern. Nferious additional features have been pro- 
posed. A particular feature of the Standard is the one bit bypass connection between serial input and serial output. For 

35 executing the debugging, one or more data registers are interconnected between the on-chip bus and the serial JTAG 
test interface. In this way, only the four or five additional pins proper to the JTAG interface must be added to the normal 
circuit functionality. By itself, the Direct Memory Access or DMA feature is a standard functionality in a microprocessor 
environment. 

Computer Design, vol.32, Jan 1 993, pages 65-74 ; Marrin K. : "DSP development tools engage mainstream design- 

40 ers" describes debug enhancements in that JTAG debug ports, which are not used only for boundary scan-test but 
also to monitor and control on-chip resources, are added to DSPs. 

According to the invention, the JTAG interface can now access various scan chains that consist of serialized flip- 
flops. Moreover, via a number of appropriately arranged ones of these scan chains the JTAG interface can execute 
direct memory access DMA to all on-chip functional units that are connectable as slaves to the on-chip bus. For ex- 

45 ample, JTAG may access RAM memory for effecting LOAD, CHANGE, INSPECT, BOOT and other operations. Like- 
wise, it may access ROM memory. Likewise, it may access breakpoint registers for effecting SET and INSPECT op- 
erations. Likewise, it may access an external event trace buffer memory for recording instructions. Likewise, it may 
access various other elements, such as counters, timers, FIFO storage, control registers, and other elements, according 
to the needs of a designer person. It is noted that all of these elements can also be reached by the software, so that 

50 accessing these elements by the JTAG interface is an excellent mechanism for allowing debugging, tracing, and other 
test support mechanisms. As a consequence of the above, the processor element need not halt its operation during 
communication between the debug support element and the various scan chains. Advantageously said JTAG interface 
element allows downloading informations communicated by an external station. This downloading process is based 
on the peek and poke primitives, known from their widespread usage in various computer languages, for so accessing 

5S the memory and filling it quickly with new information. 

Advantageously, the debug support element is externally to the internal bus directly connected to the processor 
element. This close interconnection allows for easy scrutinizing, without in doing so necessitating bus cycle time: in 
this way, a trace buffer located in the debug support element can be filled directly from the processor element. 
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Advantageously, the J TAG interface directly accesses one or more breakpoint registers. This allows to load these 
registers at run time for subsequent evaluation while maintaining standard run speed. 

Advantageously, the debug support element contains an internal event trace buffer memory for accommodating 
a restricted set of contents of non-sequential addresses as generated by the processing element and allowing at least 
one of the following storage modes for a limited time operation of said microprocessor: storing of all non-sequential 
addresses, and/or storing of all call, jump, and trap addresses, or any appropriate selection or part of these. This 
represents a broad spectrum of debug operations that need only little hardware, inasmuch as only an appropriate 
fraction of the addresses, and in particular the relatively most critical ones of these are retained. 

Further advantageous aspects of the invention are recited in dependent Claims. The invention in particular has 
been considered as an advantageous functionality extension for SPARC microprocessors, although its application is 
not limited to this particular type. 

BRIEF DESCRIPTION OF THE DRAWINGS 

These and other aspects and advantages will be discussed more in detail with reference to the disclosure of 
preferred embodiments hereinafter, and in particular in and by the appended Figures that show: 

Figure 1 a microcontroller with hardware for event tracing; 
Figure 2 a tracing example; 
20 Figure 3 a JTAG boundary scan block in the architecture; 

Figure 4 an embodiment of serial event output facilities; 
Figure 5 an embodiment of a basic serial out protocol. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

25 

Figure 1 shows a debugging environment with a microcontroller or microprocessor provided with the basic hard- 
ware for event tracing. Microcontroller 20 sits on target board 24 together with its system RAM memory 26. The mi- 
crocontroller 20 is provided with a boundary scan or JTAG debug interface 46 that via JTAG board connector 28 and 
interconnection 34 is connected to host workstation 32 that itself has a JTAG interface card 30. The latter can interface 

30 to a plurality of JTAG interconnections such as 36. Microcontroller 20 has an on-chip system bus 48 that interconnects 
various subsystems, such as JTAG debug interface 46, a debug support unit 56 with an on-chip trace memory 58, 
processing element with caches 60, memory interface 62, and various further unnamed subsystems 50-54, such as 
an IO interface element. Via interconnection 64, debug support unit 56 is connected to a test probe or bgic analyzer 
that has a serial-to-parallel converter 42, an event trace memory 40, and receives a time stamp from a source not 

35 shown. The event trace memory 40 via interconnection 36 that may be JTAG based is connected to host work station 
32. System RAM 26 contains a debug section 44 that is addressed by a symbolically indicated debug trap vector 
Finally, there is a direct interconnection 57 between the debug support unit 56 and JTAG interlace 46. 

Serial output 64 has a one-bit-wide data path plus CLKOUT and provides real-time information on the occurrence 
of software or hardware triggered events, such as: it shows the general (low of software, identifies task latency in a 

40 multitasking system, identifies software sections that are of special interest for debugging, and triggers an external 
logic analyzer hardware under software control. According to the invention, the event trace facility does not need to 
provide trace reconstruction with instruction address granularity but it gives a good overview on the time behaviour of 
embedded real-time software. It is useful for timing analysis and performance measurement. 

The serial event information is a 0.. 16 bit data packet plus start and stop condition signals. Together with a time 

45 stamp, the parallelized data packet is stored in the event trace memory 40, that has a cycling address counter. The 
CLKOUT signal not separately shown provides synchronization, and may be a subharmonic of the microcontroller 
clock. Events may without limitation be triggered by three causes: execution of a special non-privileged instruction, 
entry of traps and/or interrupts, and by a matchpoint occurring at match by a debug condition compare register In the 
realization with the SPARC microcontroller the yet unused and unprivileged instruction WR ASR31 is used as event 

so trigger ; its opcode field accommodates a 1 3 bit immediate operand. Its value is defined either manually during software 
development, or automatically at compilation time via a prologue/epilogue mechanism for every individual subroutine. 
Execution of the WR ASR31 instruction occupies the IU pipeline for only one clock cycle. In many cases this is so little 
that the debug instructions need not be removed from the final code. Another hardware platform would need a similar 
feature. 

55 The above explains the double usage of the JTAG interface for debugging. Two optional pins may be added for 
enhanced functionality. The double usage of the interface needs only little extra on-chip hardware area for implementing 
basic features, but may be expanded in a straightforward way. Basic features are the following: 
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JTAG provides straightforward host-target communication 
Provision of software breakpoints 
Hardware single-step 

External break request and reset control from the host. 
A few useful extensions are: 



w 



15 



20 



25 



30 



35 



45 



50 



55 



- Hardware breakpoints on instruction address, data address, data store value and range 
On-chip memory for instruction address trace at various different levels of trace granularity 
Serial real-time event trace output facility including event identification. 

The JTAG debug interface block at the target processor side provides the following internal data registers for debug 
communication purposes that can be read and written bit-sequentially by the host system: DMA_ADDR ; DMA_DATA ; 
DMA_CONTROL_STATUS. By means of these registers, the interface block provides a bridge functionality between 
the external JTAG bus and an on-chip system bus. Any memory-mapped slave-type device connected to the latter bus 
system can be accessed via the bridge from the host system. Thus the host is able to perform direct memory accesses 
(DMA) to any memory mapped resource of the target system. The DMA_ADDR register must be initialized with the 
target address before a DMA access. The DMA_DATA register must be initialized with write data before a write access 
can be initialized. After a DMA_READ access, the DMA.DATA register contains the read data. The 
DMA_CONTROL_STATUS register provides the following control functions: 

select DMA access type (read/write, ASI-control space, byte-halfword-word selection) 
start the DMA access 

lock the system for exclusive JTAG usage of the DMA feature 
auto-incrementing of the DMA_ADDR register contents 
force a system RESET 

issue an external debug break or trap request 
handshake flags for host -target monitor communication protocol. 

n this manner a cooperation between the communication protocol and the direct memory access facility has been 
realized. The registers have been well-defined and geographically clustered. The DMA is synchronized by a different 
clock TCK than the system clock and its register accessing can be executed even if the system clock is unavailable, 
such as in standby state. Also the test clock TCK may be much slower than the system clock; cf. Figure 3 ; nos. 96. 
98. These organizational aspects facilitate designing the control software, because all relevant information is now 
present inside the DMA scan chain. 

The following status information is visible to the host system when reading the DMA_CONTROL_STATUS register: 

DMA busy and handshake flags for bidirectional host<-»target monitor communication protocol 
break status and status flags indicating states such as processor error or power down. 

If the JTAG interface is indeed provided with an independent TRSTN reset line, it is possible to reset the DMA 
registers while the target system is kept running. An even more interesting operating mode is keeping the processor 
in reset and initialize the JTAG facilities through loading appropriate registers. Now, the following exemplary subsystems 
can be accessed through JTAG DMA access: 

interface to an external memory that becomes JTAG-accessible 
IO interface that renders IO devices JTAG-accessible 

internal memory such as RAM, ROM, Cache and MMU Translation lookaside buffer TLB 
internal registers such as timers, counters, interrupt and application cell functions 
debug support registers, such as for break-point match and application cell functions. 

All these subsystems are memory-mapped and can be accessed directly from the target system without processor 
intervention. The DMA controlled by the JTAG interface is an extremely straightforward vehicle. The processor element 
can even continue executing any ongoing program execution. Furthermore, because JTAG needs only brief accesses, 
the system bus remains largely available for other stations intending to be bus master. Finally, uploading and down- 
loading between internal target system and external target system is fast: for example <5 sees for 1 MByte data at a 
10 MHz test clock. Internal processor registers, such as the register file(s) or the Ancillary State Registers ASR that 
have been defined in the SPARC microcontroller cannot be directly accessed through JTAG, but only by monitor soft- 



4 



EP 0 636 976 B1 

ware. For this purpose, the host system can request a debug break via the JTAG DMA_CONTROL_STATUS REGIS- 
TER, thus forcing the processor to enter the monitor program. The data exchange between the monitor program that 
runs on the target system, and the host computer is performed via memory locations that are accessible via the JTAG 
DMA, and associated input and output buffers. The communication protocol has a handshake. 
5 As shown in the debug configuration system of Figure 1 , the system RAM has a small section allocated for debug 

support. This section contains 

- a debug trap handler program that provides a link to the monitor program 

- either a full monitor program, or only those monitor command routines that may execute one command at a time 
to - buffers for communicating command +parameters and response between host and target monitor 

• direct accessibility from the host via JTAG DMA. 

These features have cost advantages in that no extra pins are needed : no dual port is necessary that should be 
accessible from both host and target processor and no time-critical multiwire link is necessary from target processor 
'5 to a remote debug RAM. During the debugging process no special firmware is required like a boot-PROM on the target 
board. Debug trap handlers and monitor program are downloaded via JTAG into system RAM before the target proc- 
essor starts program execution. 

An extra problem for real-time instruction tracing in RISC type processors is that in each clock cycle one or more 
instructions are fetched from an internal cache memory whose addresses are not visible at the chip boundary. To solve 
20 the problem a limrted size on-chip trace memory of 32 entries has been provided that loads internal addresses and 
can be read by the host via the JTAG facilities. The small size of the trace memory necessitates scrupulous assigning 
of its capacity. Various trace modes are; load all addresses, load all non-linear addresses, that is those that are other 
than the simple increment-by-one, and load only addresses following call, trap or jump instructions. A further usage is 
to start loading upon reaching a preset breakpoint, and subsequently loading all addresses until the trace memory is 
25 full. Various combinations of the above are also useful. 

The serial event output facility 64 is used for time stamping of the entries in the external trace memory 40. The 
event output provides information in real-time on the occurrence of software- or hardware -triggered events. Its main 
applications are to: 

30 - show the general flow of software 

identifies interrupt latencies in a system 

identifies task activity in a multitasking system 

identifies software sections that may need special debugging 

- triggers an external analyzer under software control. 

35 

A relatively small trace buffer, although not providing trace reconstruction with instruction address granularity, 
generally will give an excellent overview on the time behaviour of embedded real time software, which is useful for 
performance measurement and timing analysis. The required facilities are a serial parallel converter 42, a trace memory 
40 of appropriate capacity, and a timestamp generator mechanism as indicated. 

40 As shown in the example of Figure 2, trigger instructions are preferably placed at strategic positions, such as trap 

exits, subroutine entries and exits, and jump table targets. In Figure 2 time runs horizontally. The solid steps show 
machine activity that alternates between main program level 78, subroutine levels 72, 74, 76, and trap routine level 
70. Trace 80 symbolizes the serial event output data through blocks. Here, these occur at start, at entrance and exit 
of all subroutines, at watchpoint hit (diamond at level 76) and finally at breakpoint hit (level 78). A WR ASR31 instruction 

is is shown by a small circle and a trap is symbolized by a block. The span of coverage of the internal trace buffer is 
indicated at level 82. When using a watchpoint hit, the match occurred causes a debug trap to the processor, which 
then must execute a trap handler. Subsequently, it will wait for commands from the host. This is communicated to the 
host via a status register contained in the JTAG interface. 

Figure 3 illustrates the JTAG boundary scan block inclusive of its various operating modes in the processor archi- 

so tecture. For brevity, the JTAG facilities are only recited without further detail. The five pins at bottom are test clock TCK. 
test reset TRST, test mode select TMS that controls the various modal transitions in TAP controller 90, test data in TDI, 
and test data out TDO. The port acts as a DMA master on the internal bus shown as a heavy line at the top; the port 
may access any slave connected to the bus, even while the processing element is executing. During arbitration, the 
port has highest priority. The DMA operation is initiated via the boundary scan external interface, such as by all external 

ss . work station. The boundary scan facilities number a device I D register 1 06, Clocked instruction register 1 04 loads from 
TDI and accommodates five-bit instructions. The following instructions are used: 
Instruction • Mnemonic • Register length • Function 
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00000 


EXTEST 


32 


Select boundary scan register 


00001 


IDCODE 


32 


Chip identification code 


00010 


SAMPLE/PRELOAD 


32 


Select boundary scan register 


0 lUUO 


MACRO 


1 


Macro test mode, enables production test. 


10000 


DMA_ADDR 


32 


DMA address renistAr 


10001 


DMA.DATA 


32 


DMA data register 


10010 


DMA__CNTL_STAT 


14 


DMA control/status register 


10011 


DMA_ALL 


78 


all DMA registers in one chain 


11111 


BYPASS 


1 


bypass mode 



In the above, the third column gives the applicable register length. For simplicity, the bypass itself has not been 
shown. In addition to the above, the Figure has DMA control elements and further output multiplexer 1 0S that feeds 
result data output line TDO. 

The instruction register is implemented in two parts: a parallel load register 102 ; and a shift register 104. While a 
new instruction is received via input TDI, the parallel register retains the value of the preceding instruction. When TAP 
controller 90 subsequently enters the UpdateJR state, the contents of the shift register are transferred in parallel to 
the parallel load register, and become the new instruction. For example, the instruction DMA_ADDR will toad 32 bits 
from TDI into the J TAG data register 'DMA_ADDR'. At DR-Update, this data is available as device address on the 
internal bus from register 94. Register 94 has a count input with a +4 increment as shown and an effective count range 
of six bits; the two least significant bits are not used. It is controlled by signal DMA_ADDRJNC from DMA control 
register 100: this allows up to sixty-four consecutive addresses to be DMA-ed without needing to reload DMA_ADDR. 
DMA data register 96 is used to store data communicated to or from the slave thus addressed. Write data must be 
loaded before DMA_Start request is given. Read data is captured under synchronization from system clock in register 
98. At Capture_DR this data is transferred from register 98 to register 96 and serialed out via multiplexer 1 08 and TDO. 

DMA_CNTL_STAT register 100 has 14 defined bits, as follows: 

[0]: SYS_CLK_ON indicates the status of the system clock. When system clock is active, this bit = 1 . 

[1]: TOF_CONTL_STAT gives the control and status of the Test Output Full flag in the DSU DSTAT control register. 
The TOF flag will not be cleared if this bit is set, but the DMA access is terminated by a bus error. 

[2]: TIF_CNTL_STAT gives the control and status of the Test Input Full flag in the DSU DSTAT control register. 
The TIF flag will not be set if this bit is set, but the DMA access is terminated by a bus access. Read: the status 
of the TIF flag in DSU DSTA control; write: 1 = set TIF bit in DSU STAT control register after DMA is finished, 0 = 
no change. The two bits TlF^CONTL_STAT and TOF_CONTL_STAT together represent a handshake for the com- 
munication protocol. 

[3J: BRK_STAT read-only: 1 = IU is in break state, 0 = IU is not in break state. 

[4]: DMA_ADDR_INC write only: 1 = add 4 to DMA_ADDR data register before starting the DMA, 0 = no change. 
The addition is modulo-256. The DMA_ADDR is incremented each time an update of the DMA_CNTL_STAT is 
done with the DMA_ADDRJNC bit set 1, The two least significant bits (1-0) will not change. 

[5]: DSU_CNTL_STAT: DMA_LOCK. Lock the PI (processor-internal) core bus if a DMA is started. This can be 
used to lock the PI core bus if more than one atomical DMA access is required. The last DMA access must have 
DMA_LOCK = 0 to unlock the PI core bus. Write-only : 1 = lock PI core bus; 0 = no lock. 

[6]: DSU_CNTL_STAT: DMA_CSP Do a control space DMA access. This bit allows the JTAG/Test module to read/ 
write ASI mapped resources on the PI core bus. Address bits DMA_ADDR [31 :24] indicate the control space (=ASI) 
identifier. The low 24 bits (DMA_ADDR [24:0]) form the address within the control space. Write only: 1 = control 
space DMA, 0 = no control space DMA. 

[8:7]: DMA_CNTL_STAT: DMA.SZ [1 :0], The two bits indicate the data size of the DMA: write only: 00 = byte, 01 
= half word, 10 = word, 1 1 = illegal. 



6 



EP 0 636 976 B1 



[9]: DMA_CNTL_STAT: DMA_RWN. The direction of the DMA access. Atomic read/writes can be done by setting 
the DMA_CNTL_STAT [5] bit (lock the PI core bus after the read). Write only: 1 = Read access. 0 = Write access. 

[10]: DMA_CNTL_STAT: DMA_ERR. This bit indicates that a bus error has occurred during the DMA. This bit is 
5 only valid if the DMA has been finished (DMA_START_BUSY = 0). Read only: 1 = error, 0 - no error. 

[11]: DMA_CNTL_STAT: DMA_START_BUSY. This bit controls the start of the DMA and signals if the DMA has 
been finished. The DMA_START_BUSY has to be written 0 to allow the read value to go to 0 (= DMA finished). 
Setting DMA_START_BUSY will clear this bit. Read: 1 = DMA busy, 0 = DMA finished or not started. Write: 1 = 
10 start DMA, 0 = no change. The basic loop for starting DMA is: 

write DMA_CNTL_BUSY = 1 

wait till DMA_CNTL_BUSY = 1 (DMA access is started) 
write DMA_CNTL_BUSY = 0 
75 wait till DMA__CNTL_BUSY = 0 (DMA access is finished) 

The read DMA_CNTL_BUSY value will not change from 1 to 0 before a write 0 to DMA_CNTL_BUSY has been 
effected when the read DMA_CNTL_BUSY = 1 . This is to ensure that the software has detected a '1 ' (DMA started) 
before subsequently the 'DMA finished' is signalled. 

20 

[12): D M A_C NTL_START: JTAG_BREAK. Write only: 1 = generate break trap, 0 = no break trap. With this bit a 
break trap can be generated in the IU. 

[13]: DMA_CNTL_STAT: JTAG_RESET. Write only 1 = reset, 0 = no reset. With this bit the circuit can be reset via 
25 the JTAG interface. 

Some example values for the DMA_CNTL_STAT register contents; 

00 1001 0001 0100: Start DMA write, size is word, no control space, no bus lock, increment 

30 DMA_ADDR before DMA is started and set TIF flag in DSU_STAT control register 

if DMA is ready. 

00 1010 01 10 0010: Start DMA read in control space, size = byte, lock the bus for further DMA trans- 

fers, clear TOF flag when DMA is 

finished. 1 0 0000 0000 0000: RESET the circuit. 

3S 01 0000 0000 0000: generate a break trap in the integer unit. DMA_CNTL_STAT[3] can be used to 

check if the break trap has indeed been taken. 

The PI core bus has been shown by the heavy line at the top of the Figure. Further subsystems of the arrangement 
are JTAG controller module 90, called TAP controller with test clock, test reset, and test mode inputs, DMA control 
40 module 92, registers DMA.ADDR 94 and DMA.DATA 96, clock register SS_CLOCK 98, register DMA_CNTL_STAT 
100,, Information holding register and decode 102, JTAG information register 104, device ID register 106, and output 
multiplexer 1 08. One of the inputs to the latter is JTAG chain input 10B. Various subsystems as shown can be loaded 
form the serial data in chain TDI. For brevity, the hardware particular to the JTAG standard interface has only been 
sketched in a summary way. 

45 Figure 4 shows an example of hardware facilities for serial event output. In the setup, the central PI Bus is attached 

to various subsystems, and surrounded by a few non-connected subsystems, these are as follows: 

the processor clock 120 
the central reset facility 1 22 
50 - the bus control unit 124 
interrupt controller 1 26 

instruction requester (cache), symbolized 128 
memory management facility 130 
- data requester (cache), symbolized 132 
55 - debug support unit 134 

serial event output facility 136 

various unnamed further subsystems 1 38 

memory interface element 140 
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- JTAG boundary scan facilities 142 including TAP controller 
integer processing unit 144 

- optional floating point unit 146. 

Most of the above subsystems may be of standard functionality. For simplicity, the direct interconnection between 
the debug support unit and the JTAG interface (item 57 in Figure 1) has not been shown here. It is used to directly and 
quickly communicate breakpoint hit information into the DM A_CNTL_STAT register. Via pollingviathe PI bus this would 
have taken much longer but now it can be effected by means of the BRK_STAT functionality discussed with respect 
to Figure 3. 

Figure 5 shows an example of a basic serial out protocol that may occur on line 64 in Figure 1 . The protocol leans 
somewhat on the well known I2C protocol described in EP Patent Application 51332. During standard data transfer, 
transitions on the single data line (lower trace) may occur when the clock (upper trace) is low. If the clock is high, no 
such transition is allowed. Start condition 1 52 and stop condition 1 54 violate these prescriptions through data transition 
when the data is high, and so realize their intended operations. Outside of the data transfer interval, the clock trace 
may continue. In the present example the serial event outputting has been caused by the execution of the brief instruc- 
tion WR ASR31 . Just as in I2C, the event identifications are coded in a range from 0 to 2 13 by the number of 1 3 bits. 
For enhancing transfer speed, leading zeroes are suppressed. In this way, also a flexible packet length is realized. 
When no serial event information is output, the EVENT_OUT line is constantly held at T level. During transmission, 
the first bit indicates the packet type ; and is followed by data. 

The information received at the probe 38 in Figure 1 is parallelized and stored together with a time stamp taken 
at the start condition in the external event trace memory. The CLKOUT signal from the target processor is required for 
synchronization purposes. Bits on the serial signal output are synchronous to CLKOUT normally running at system 
clock speed. In case of very high system clock frequency it may be required to perform the serial output at a subharmonic 
of the system clock. Facilities on the target board allow easy connection to a probe or logic state analyzer. 

Recapitulating, the JTAG facilities are enhanced by adding DMA registers for communicating between JTAG and 
the on-chip system bus. In particular DMA may function as bus master. Seen from the debug support unit DSU, an 
instruction trace memory has been added on-chip, and is provided with a serial event output. Finally, an external event 
trace buffer has been added that is driven by a serial event output. 

Claims 

1. A microprocessor (20) comprising a processor element (60), a memory interface element (62), an IO interface 
element, a debug support element (56) and an internal bus (48) interconnecting all above elements, characterized 
by comprising attached to said internal bus a registered boundary scan standard JTAG interface element (46) that 
accesses one or more scan chains inside said microprocessor, and furthermore said JTAG interface element is 
arranged for controlling DMA-type exchanges via said internal bus with other elements connected to said internal 
bus. 

2. A microprocessor as claimed in Claim 1 , wherein said JTAG interface element (46) allows bidirectional downloading 
informations with respect to an external station (32). 

3. A microprocessor as claimed in Claims 1 or 2, wherein said debug support element (56) is externally to said internal 
bus (48) directly connected to said processor element (60). 

4. A microprocessor as claimed in Claims 1 , 2 or 3, wherein the JTAG interface element (46) directly accesses one 
or more breakpoint registers. 

5. A microprocessor as claimed in any of Claims 1 to 4, wherein the debug support element (56) directly accesses 
a trace buffer external (40) to the microprocessor (20). 

6. A microprocessor as claimed in any of Claims I to 5, wherein said debug support element (56) contains an internal 
buffer memory (58) for accommodating a restricted set of contents of non -sequential addresses as generated by 
the processing element and allowing at least one of the following storage modes for a limited time operation of 
said microprocessor: storing of all non-sequential addresses, and/or storing of all call, jump, and trap addresses, 
or any appropriate selection or part of these. 

7. A microprocessor as claimed in Claim 6, wherein said debug support element (56) interfaces externally to the 
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microprocessor via a serial clocked data path (64) to an external event trace buffer memory (40) for thereto out- 
putting event signalization for storage. 

8. A microprocessor as claimed in Claim 7, wherein a special instruction (WR ASR31) is arranged for controlling 
loading of said external event trace buffer memory. 

9. A microprocessor as claimed in Claims 7 or 8, wherein the external trace buffer memory (40) is arranged for 
together with recording any word therein also recording an actual time stamp indication. 

10. A microprocessor as claimed in Claims 7, 8 or 9, wherein the external trace buffer memory (40) is arranged for via 
a standard further interface outputting data (36) to a standard workstation or personal computer (32). 



Patentanspruche 

1. Mikroprozessor (20) mit einem Prozessorelement (60), einem Speicher-lnterface-Element (62), einem EA-lnter- 
face-Element, einem Fehlerbeseitigungsunterstutzungselement (56) und einem alle obengenannten Elemente mit- 
einander verbindenden internen Bus (48). gekennzeichnet durch das Umfassen, befestigt an dem genannten in- 
ternen Bus, eines registrierten JTAG-lnterface-Elementes (46) vom Boundary-Scan-Standard, das auf eine oder 
mehrere Scan-Ketten innerhalb des genannten Mikroprozessors zugreift, und weiterhin dadurch, da3 das genann- 
te JTAG-lnterface-Element zum Steuern von DMA-Austausch mit anderen mit dem genannten internen Bus ver- 
bundenen Elementen Ober den genannten internen Bus ausgebildet ist. 

2. Mikroprozessor nach Anspruch 1 , wobei das genannte JTAG-lnterface-Element (46)bidirektionales Herunterladen 
von Informationen bezuglich einer externen Station (32) zulaGt. 

3. Mikroprozessor nach Anspruchs 1 oder 2, wobei das genannte Fehlerbeseitigungsunterstutzungselement (56), 
extern zu dem genannten internen Bus (46), direkt mit dem genannten Prozessorelement (60) verbunden ist. 

4. Mikroprozessor nach den Anspruchen 1, 2 oder 3, wobei das JTAG-lnterface-Element (46) direkt auf ein oder 
mehrere Haltepunktregister zugreift 

5. Mikroprozessor nach einem der Anspruche 1 bis 4 : wobei das Fehlerbeseittgungsunterstutzungselement (56) di- 
rekt auf einen Ablauffolgepuffer (40) extern zu dem Mikroprozessor (20) zugreift. 

6. Mikroprozessor nach einem der Anspruche 1 bis 5, wobei das genannte Fehlerbeseitigungsunterstutzungselement 
(56) einen internen Pufferspeicher (58) zum Aulnehmen einer begrenzten Menge von Inhalten von nicht-sequen- 
tiellen Adressen enthalt, wie sie von dem Verarbeitungselement erzeugt werden, und zumindest eine der folgenden 
Speicherbetriebsarten fur eine begrenzte Zeitoperation des genannten Mikroprozessors zulaftt: Speichern aller 
nicht-sequentiellen Adressen und/oder Speichern aller Ruf-, Sprung- und Trapadressen oder einer geeigneten 
Auswahl oder eines Teils davon. 

7. Mikroprozessor nach Anspruch 6, wobei das genannte Fehlerbeseitigungsunterstutzungselement (56) extern zum 
Mikroprozessor uber einen seriell getakteten Datenpfad (64) mit einem externen Ereignisablauffolgepufferspeicher 
(40) gekoppelt ist, urn dorthin eine Ereignissignalisierung zum Speichern auszugeben. 

8. Mikroprozessor nach Anspruch 7, wobei ein spezieller Befehl (WR ASR31) angeordnet ist. um ein Laden des 
genannten externen Ereignisablauffolgepufferspeichers zu steuern. 

9. Mikroprozessor nach den Anspruchen 7 oder 8, in dem der externe Ablauffolgepufferspeicher (40) ausgebildet ist, 
um zusammen mit dem Aufzeichnen eines beliebigen Wortes darin auch eine tatsachliche Zeitstempetangabe 
aufzuzeichnen. 

10. Mikroprozessor nach den Anspruchen 7, 8 oder 9, in dem der externe Ablauffolgepufferspeicher (40) ausgebildet 
ist, um uber ein standardmafliges weiteres Interface Daten (36) an eine Standardworkstation oder einen Perso- 
nalcomputer (32) auszugeben. 
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Revendicatlons 

1. Microprocesseur (20) comprenant un element processeur (60), un element d'interface de memoire (62), un ele- 
ments d'interface E/S, un element support de mise au point (56) et un bus interne (48) reliant tous les elements 
susmentionngs, caracte>ise en ce qu'il comprend associes audit bus interne, un element d'interface JTAG standard 
k balayage borne k registres (46) qui accede a una ou plusieurs chaines de balayage k I'interieur dudit micropro- 
cesseur, et en ce qu'en outre ledit element d'interface JTAG est configure - pour commander les echanges de type 
DMA par I'intermediaire dudit bus interne avec d'autres Elements connected audit bus interne. 

2. Microprocesseur suivant la revendication 1 : dans lequel ledit element d'interface JTAG (46) permet le telecharge- 
ment bidirectbnnel a partir du central d'informations concernant un poste exteme (32). 

3. Microprocesseur suivant la revendication 1 ou 2, dans lequel ledit Element support de mise au point (56) est, de 
maniere externe audit bus interne (48), directement connects audit element processeur (60). 

4. Microprocesseur suivant la revendication 1 , 2 ou 3, dans lequel I'el6ment d'interface JTAG (46) accede directement 
k un ou plusieurs registres de point d'arr§t. 

5. Microprocesseur suivant Tune quelconque des revendications 1 k 4, dans lequel ledit Element support de mise au 
point JTAG (56) accede directement k un tampon de recherche (40) externe au microprocesseur (20). 

6. Microprocesseur suivant I'une quelconque des revendications 1 k 5, dans lequel ledit Element support de mise au 
point (56) contient une memoire tampon interne (58) pour accueillir un jeu limits contenant des adresses non 
sequentielles telles que g£ne>eds par Idtement processeur et pour permettre un fonctionnement dans le temps 
limits dudit microprocesseur pour au moins un des modes de stockage suivants : stockage de toutes les adresses 
non sequentielles et/ou stockage de toutes les adresses d'appel, de branchement et de ptegeage, ou toute autre 
selection appropried ou partie de celle-ci. 

7. Microprocesseur suivant la revendication 6, dans lequel ledit 6l6ment support de mise au point (56) sert d'interface 
de maniere externe au microprocesseur par I'intermediaire d'un trajet de donneds s6rie rythmS (64) vers une 
memoire tampon de recherche ddvenement externe (40), afin de produire de ce fait une indication ddvenement 
pour stockage. 

8. Microprocesseur suivant la revendication 7, dans lequel une instruction spediale (WR ASR31 ) est configured pour 
commander le chargement de ladite memoire tampon de recherche d'ev^nement externe. 

9. Microprocesseur suivant la revendication 7 ou 8, dans lequel la memoire tampon de recherche d'evenement ex- 
terne (40) est configured pour, outre I'enregistrement de tout mot dans celle-ci, £galement enregistrer une indication 
d'horodatation redlle. 

10. Microprocesseur suivant I'une quelconque des revendications 7, 8 ou 9, dans lequel la m6moire tampon de re- 
cherche d'evedement exteme (40) est configured pour produire des donneds (36), par I'intermediaire d'une inter- 
face supplemental standard, pour un poste de travail standard ou pour un ordinateur personnel (32). 
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